Sarah Strochak, Kyle Ueyama, Aaron R. Williams
What is 2 + 2?
What is 2 + 2?
## [1] 4
What is the median price of diamonds with carat > 1 and a Good cut?
What is the median price of diamonds with carat > 1 and a Good cut?
## # A tibble: 1 x 1
## `median(price)`
## <int>
## 1 6412
How could increasing the retirement age affect the poverty rates of Hispanic women ages 62 and older?
How could increasing the retirement age affect the poverty rates of Hispanic women ages 62 and older?
Deliberate steps should be taken to minimize the chance of making an error and maximize the chance of catching errors when errors inevitably occur.
Computational reproducibility should be embraced to improve accuracy, promote transparency, and prove the quality of analytical work.
Code should be written so humans can easily understand what’s happening—even if it occasionally sacrifices machine performance.
Analyses should be designed so strangers can understand each and every step without additional instruction or inquiry from the original analyst.
Research and data are non-rival and non-exclusive. They are public goods that should be widely and easily shared. Decisions about tools, methods, data, and language during the research process should be made in ways that promote the ability of anyone and everyone to access an analysis.
Analysts should seek to make all parts of the research process more efficient with clear communication, by adopting best practices, and by managing computation.
Photo by StataCorp LP, CC BY-SA 4.0, Unaltered
Source is unknown
R is an implementation of the S programming language, which was created at Bell Labs in the 1970s.
S-PLUS is a proprietary implementation of R that was common for years.
R is a free, open-soure programming language created by Ross Ihaka and Robert Gentleman at the Univesity of Aukland in the early 1990s.
R is mostly written in R, C, and FORTRAN.
The Comprehensive R Archive Network was introduced in 1997.
Repository of popular R packages with basic standards and quality control.
Comprehensive set of tools for data science
Core: ggplot2, dplyr, tidyr, readr, purrr, tibble, stringr, forcats
Free text by Hadley Wickham and Garrett Grolemund
IDE and for-profit company that funded and professionalized R development
.R and .Rmd“Good coding style is like correct punctuation: you can manage without it, butitsuremakesthingseasiertoread.” ~ Hadley Wickham
Collections of R, C, C++, and FORTRAN code that expand the functionality of R.
character
numeric
logical
factor
1: use it, use it again, use it some more.
Comments